SINAI experience at CLEF

نویسندگان

  • Fernando Martínez Santiago
  • Luis Alfonso Ureña López
چکیده

This paper reports our work on CLEF and CrossLanguage Information Retrieval using CLEF resources. We aim to construct a highly languageindependent CLIR model. To accomplish this objective, several problems must be overcome: text translation or pseudo-translation and merging the obtained results for each language for a given query. Three issues of text-translation are investigated: the impact of translation probabilities, automatic multi-word recognition, and the generation of similarity thesauri from a Web corpus. Because the proposed model is query-translation driven, it is necessary to merge several monolingual results in a unique multilingual list of documents. To accomplish this task, we propose a new approach, which we call 2-step RSV, and we show that it performs better than more traditional approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SINAI at QA@CLEF 2007. Answer Validation Exercise

This paper describes the rst participation of the SINAI (Intelligent Systems of Access Information) group of the University of Jaén in the AVE task of QA@CLEF 2007. We have developed a system made up of training and classi cation processes, that uses machine learning methods (bbr, timbl). Based on lexical features it obtains good results, a 41% of QA accuracy.

متن کامل

SINAI at CL-SR Task at CLEF 2007

This paper describes the first participation of the SINAI team in the CLEF 2007 CLSR track. This year, we only want to establish a first contact with the task and the collections. Thus, we have pre-processed the collection using the Information Gain technique in order to filter the labels with most relevant information. We have used the LEMUR toolkit as the Information Retrieval system in our e...

متن کامل

SINAI at ImageCLEF 2009 WikipediaMM Task

This paper describes the first participation of the SINAI team in the CLEF 2009 wikipediaMM task. This year, we only want to establish a first contact with the task and the collections. Thus, we have generated a new collection expanding with WordNet terms in order to perform the information included in this collection. In addition, we have expanded de queries with WordNet too. We have used the ...

متن کامل

SINAI at INFILE 2009: Experiments with Google News

This paper describes the SINAI team participation in the INFILE routing and filtering track of the CLEF campaign. This is the first participation of the SINAI research group in the INFILE task. We have participated in the batch filtering subtask and submitted two experiments: one using the topics’ text as learning data to train a classifier, and another one where training data has been construc...

متن کامل

SINAI at ImagePhoto 2009

This paper presents the fourth participation of the SINAI group, University of Jaén, in the Photo Retrieval task at Image CLEF 2009. Our system uses only the text of the queries, and a clustering system (based on kmeans) that combines different approaches based on a different use of the cluster data of the queries. The official results shown that the combination between the title of each query ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inteligencia Artificial, Revista Iberoamericana de Inteligencia Artificial

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2004